CDS

Accession Number TCMCG075C29307
gbkey CDS
Protein Id XP_007009529.2
Location join(2916071..2916664,2916756..2917316)
Gene LOC18586229
GeneID 18586229
Organism Theobroma cacao

Protein

Length 384aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_007009467.2
Definition PREDICTED: ankyrin-1 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category V
Description Ankyrin repeat-containing protein
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko03036        [VIEW IN KEGG]
KEGG_ko ko:K21435        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs GO:0001101        [VIEW IN EMBL-EBI]
GO:0002376        [VIEW IN EMBL-EBI]
GO:0006950        [VIEW IN EMBL-EBI]
GO:0006952        [VIEW IN EMBL-EBI]
GO:0006955        [VIEW IN EMBL-EBI]
GO:0008150        [VIEW IN EMBL-EBI]
GO:0009719        [VIEW IN EMBL-EBI]
GO:0009725        [VIEW IN EMBL-EBI]
GO:0009751        [VIEW IN EMBL-EBI]
GO:0010033        [VIEW IN EMBL-EBI]
GO:0014070        [VIEW IN EMBL-EBI]
GO:0042221        [VIEW IN EMBL-EBI]
GO:0042493        [VIEW IN EMBL-EBI]
GO:0045087        [VIEW IN EMBL-EBI]
GO:0046677        [VIEW IN EMBL-EBI]
GO:0050896        [VIEW IN EMBL-EBI]
GO:1901700        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGGATGAAAGGTTGAGTGGTGCAGCTCTATCAGGAAATATAGATGCCTTGTATGATTTAATCAAAGATGATGCGGATGTTTTACGACGCATCGATGAGATGGAGTTCGTTGATACTCCACTGCACATAGCTGCAGCTGCAGGGCACACCGAGTTTGCAATGGAGTTGATGAACTTAAAGCCATCATTCGCTAGGAAGCTCAACCAATGCGGCTTTAGCCCCATTTACCTAGCCTTGCAAGAGAAACAAGAAAAGATGGTGGATGATCTCCTATCAATTGATAAAGATCTCGTTCGCGTCAAAGGGAGGGAGGGTTACACTCCTCTTCATCATGCAGTCAGAGAAGGAAATGTTCCGCTTCTGTCTAAATTTCTGGAGAATTGCCCCAATTCTATCTTTGATTTGACTATTCGAAAAGAGACTGCTCTGCATATTGCAGCACAAAATAATAATTTAGAAGCTTTCGAAGCCATACTGTTTTGGATTCACAAGACCCACGAATACGACTACATGGAGAAAAGAAGAATCCTAAACTTACAGGACAAGGATGGAAACACTGTGCTGCACATGGCCGCATCAAATAACCAAACCCAGATGATGAAACTGTTAATGGAAAGCAGGATGGTTAAGGGGGATAAGGTTAATCAAAGTGGTTTTACAGCTTTGCGTGTCTTACAAGAACAAGCTCGAGTTGACAGCGCAGAGAGTGTGAACATTCTGAAACCGCCTAAAGAGTACCGGATGGATTTTGGCCAAGTGTCATATGATATTTCGAAGATGAATCTTGATACAATCAATGCGTTGCTAGTCGTATTTGCTCTGATTGTAACGATGACTTACCAAGCTCTTCTCAGCCCGCCGGGTGGAATTGAAGCTGCGGGGAAGTCAGTCATCAAGCCTAACGTGTTTATTTTGTTCTATACTTTAAATATCGCAGCTTTTGGAATTGCGTGGTTTTCAGCAGTGTTCATCATCAAGACAGTTGCCAACAAAATCGCAGGTTATGTGGTTATACTGTTCTCATTGATCTGCTTGTGCTACATTGTTGCACATTTTATCATAGCGCCAACGCTACATGTCGGTGGGGTTGCTTTTGCTGCTGCTTTCATAATTGGCAGCATCCTTGCTATAATGGTTCATGTATCCATCGCATAA
Protein:  
MDERLSGAALSGNIDALYDLIKDDADVLRRIDEMEFVDTPLHIAAAAGHTEFAMELMNLKPSFARKLNQCGFSPIYLALQEKQEKMVDDLLSIDKDLVRVKGREGYTPLHHAVREGNVPLLSKFLENCPNSIFDLTIRKETALHIAAQNNNLEAFEAILFWIHKTHEYDYMEKRRILNLQDKDGNTVLHMAASNNQTQMMKLLMESRMVKGDKVNQSGFTALRVLQEQARVDSAESVNILKPPKEYRMDFGQVSYDISKMNLDTINALLVVFALIVTMTYQALLSPPGGIEAAGKSVIKPNVFILFYTLNIAAFGIAWFSAVFIIKTVANKIAGYVVILFSLICLCYIVAHFIIAPTLHVGGVAFAAAFIIGSILAIMVHVSIA